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Abstract 

The quantum search algorithm is a technique for searching N possibilities in only O( jN) steps. Although the 
algorithm itself is widely known, not so well known is the series of steps that first led to it, these are quite different 
from any of the generally known forms of the algorithm. This paper describes these steps, which start by discretizing 
Schrodinger's equation. This paper also provides a self-contained introduction to quantum computing algorithms 
from a new perspective. 

1. Introduction Consider the following problem from a crossword puzzle: 

r _ n h _ (Solution - piranha) 

You have an online dictionary with 1, 000, 000 words in which the words are arranged alphabetically. You could pro- 
gram it to look for the solution to the puzzle so that it typically solves it after looking through 500, 000 words. It is 
very difficult to do much better than this. But that is: only if you limit yourself to a classical computer. A quantum 
computer can be in multiple states at the same time and, by proper design, can carry out multiple computations simul- 
taneously. In case the above dictionary were available on a quantum computer, it would be possible to carry out the 
search in only about 1, 000 steps by using the quantum search algorithm. 

The quantum search algorithm has evoked considerable interest among both physicists and computer scien- 
tists. This was the first important application of quantum computing that did not require the problem under consider- 
ation to have a structure and was hence applicable to several different types of problems in both physics and computer 
science. Also the framework was simple and general and could be extended to different problems and different phys- 
ical situations. 

It is unusual to write a paper listing the steps that led to a result after the result itself is well known. This is usu- 
ally of historical interest and better left to the philosophers of science. In this case, more than five years after the ini- 
tial algorithm was invented, the series of steps that led to it are still not known even by the scientists in the field. I 

recently presented an outline of this story to a small gathering of theoretical physicists 1 . After seeing the interest it 
inspired, I decided to write this more complete version. 

l(i). Quantum Mechanics The quantum search algorithm needs only a small fraction of the conceptual 
machinery of quantum mechanics. This subsection briefly mentions the concepts needed to understand the quantum 
search algorithm - it is by no means a comprehensive review of quantum mechanics. 



1 . A preliminary version of this paper was presented in the Winter Institute on the Foundations of Quantum 
Theory, SN Bose Center, Calcutta, India in Jan. 2000. 

2. This material is based upon work supported in part by the NSA and ARO under contract no. DAAG55- 
98-C-0040. 
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A classical binary switch can be either ON or OFF. The two possibilities, ON / OFF, are referred to as basis 
states or sometimes just as states for short. The switch can be completely described by specifying which basis state it 
is in - whether it is ON or OFF. On the other hand a quantum mechanical system is associated with all possible basis 
states at the same time with certain probabilities - it can be simultaneously ON and OFF. Its specification requires us 
to specify the probabilities in all basis states. Further, as the system evolves, the probabilities in the various basis 
states interact with each other in complicated ways. This already gives some feeling of the complexity, and potential 
computational power, of quantum mechanical systems. 

In order to harness the power of quantum mechanics, it is necessary to know how the probabilities in the vari- 
ous states interact with each other. The way to describe these probabilities is in "wave" terms, by a quantity called the 
amplitude. The amplitude is a complex number, with both a magnitude and a phase in each state. The specification of 
the amplitudes in each state is called a superposition or a state vector. The overall probability in each state, just like 
the intensity of a wave, is given by the absolute square of the amplitudes in that state. For example, consider a four 

1 1 1 



state system. Denote the basis states of the system by |0), |1), |2) and |3) . Let the amplitudes in these states be 



2' 2' 2 



and - ^ respectively. The probability in each of the four states is ^ , the state vector is 



which is also denoted as 



||0) + + ^|2)-||3). Note that the probabilities in each state would stay the same even if the state vector was 

||0) + ^|1) + ^|2) + ^|3) , however the state vector is very different and the evolution of the state vector in time will 

now be very different. In fact, the whole reason for describing the system in terms of amplitudes is that the evolution 
is better described in terms of amplitudes as opposed to probabilities. 

Any physical process, whether classical or quantum mechanical, must evolve in time in a way so that it con- 
serves the total probability. In Markov processes this requirement leads to the condition that the entries in each col- 
umn of the state transition matrix sum to one. For quantum mechanical processes, this requirement translates into the 
condition that the state transition matrix be unitary, i.e. the columns of the state transition matrix be orthonormal. An 



example of a 4 x 4 unitary state transformation matrix is: - 



1111 
1 1-1-1 
1-1-1 1 
1-1 1-1 



This transforms ±|0> + ^ |1> + ±|2> + ±|3> 



1111 1111 
into |0) and it transforms -|0> + + -|2)- -|3) into -|0> + -|1)--|2) + -|3> . Even though the initial superposi- 
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tions had the same probabilities in each state, after the quantum mechanical transformation the probabilities in the 
two cases become very different. 

The transformation of amplitudes by any unitary operation is always linear. It is hence enough to specify how 
the basis states are transformed. The transformation of any superposition can be obtained from this simply by sum- 
ming the transformations of each of the components, i.e. if M be any operator, |A) and \B) be any two state vectors, 
then: M(a\A) + p|B)) = aM\A) + pM|B) . This is called the superposition principle and is the reason that in Dirac 
notation a superposition is denoted as an appropriately weighted summation of the components. 
l(ii). Quantum Computing Just as classical computing systems are synthesized out of two-state systems 
called bits, quantum computing systems are synthesized out of two-state systems called qubits. The difference is that 
a bit can be in only one of the two states at a time, on the other hand a qubit can be in both states at the same time. For 

example, consider two qubits, each in the superposition: ( -p|0) + ] . This is represented as the tensor product: 



72 72 

-7=|0) + ] ® f -7=|0) + -7=|1) ) or equivalently we have a four-state system in a superposition: 

72 72 ) v72 72 J 

i|0> <E> |0> + i|0>® |1) + ||1>® |0) + ||1>® |1> which is also represented as ||00> + ^|01) + ^|10) + ^|11) . This is 

an example of a superposition that can be factored into smaller systems. However, most superpositions, such as 
1111 

- 100) + -|01)--|10) + -|11), cannot be represented as a product of smaller superpositions. The general description 

Yl Yl Yl 

of an n qubit system (which is a 2 state system) requires us to specify the amplitude in each of 2 states, i.e. 2 
quantities; whereas to specify a classical n bit system requires just n bits of information. 

Interesting and paradoxical quantum mechanical effects arise in systems that cannot be factored into smaller 
systems. Such systems are called entangled. Quantum computation algorithms, such as quantum search, make use of 
entanglement to devise fast algorithms. 

In order to design quantum computing systems, we need a basic set of building blocks analogous to the 
NAND and NOR gates that are used to build classical digital systems. Unfortunately, by writing out the transforma- 
tion matrices for NAND and NOR, it is easily seen that they are not unitary and hence cannot be implemented quan- 
tum mechanically. Fortunately, there exists a set of unitary transformations that can be implemented quantum 
mechanically, using which it is possible to design a circuit that will output any desired boolean function. This is the 
following set of three transformations: 

(i) NOT - a one-input one-output gate. Since the input and output are both qubits, this gate transforms a two state 
system into another two state system. The transformation swaps |0) and |1) . In matrix terms this is described by the 



transformation 



1 

1 



where the basis states are |0) and |1) respectively. 



(ii) CNTRL-NOT - a two-input two-output gate: the input and output both consist of two qubits - hence this is a 
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transformation from one four-state system to another four-state system. It transforms |00) into |00) ; |01) into |01) ; 



|10) into |11); |11) into |10). The transformation matrix for this is 



10 
10 
1 
10 



, where the basis states are |00) , 



|01) , |10) and |1 1) respectively. The name CNTRL-NOT arises since the first bit acts as a control for the second - if 
it is a 1, it swaps and 1 in the second bit. It must be emphasized that this simple-minded verbal description only 
holds for the basis vectors. The transformations of superpositions are more complicated and can only be obtained by 
the state transition matrix described above. 

(iii) CNTRL-CNTRL-NOT - a three-input three-output gate: the input and output both consist of three qubits - hence 
this transforms one eight-state system to another. It transforms |000) into |000) ; |001) into |001) ; |010) into |010) ; 
|011) into |011); |100> into |100); |101> into |101>; |110> into |111); |111> into |1 10) . Just like (i) and (ii), this too 
can be listed out as an 8x8 matrix transformation (an 8x8 identity matrix with the last two columns swapped). 
The name CNTRL-CNTRL-NOT arises since the first two bits act as control bits for the third - if both are 1, it swaps 
and 1 in the third bit. As mentioned before, this simple verbal description only holds for the basis vectors - for 
superpositions, we need to use the state transition matrix. 

The unitarity of these three operations is easily verified by noting that the column vectors of the transformation 
matrices are orthonormal (this is clearly the case since the gates merely permute the basis states). Using these three 

gates it is possible to design a circuit such that one of its outputs is any specified boolean function f(x) . This needs 
approximately the same number of gates as would be required in a classical implementation using NANDs and 
NORs. Such a circuit maps a superposition that is concentrated in any single input basis state, x , to another superpo- 
sition concentrated in a single output basis state f(x) . Transformations of general superpositions are obtained from 
the superposition principle. Note that to synthesize boolean functions quantum mechanically, we need three basic 
gates whereas in the classical case we needed just two (NAND and NOR). 

In order to develop more powerful quantum mechanical algorithms, in addition to these three gates, we need 
some operations that are essentially quantum mechanical and have no classical analog, i.e. the entries of the state 
transition matrix are not all 0's and l's. Two such operations that we need in the quantum search algorithm are the W- 
H transformation operation and the selective inversion operation, these are discussed in the following paragraphs. 

A basic operation in quantum computing is the operation M performed on a single qubit - this is represented by 



the following matrix: M=-^= 

J2 



1 1 
1 -1 



M is unitary since the columns are orthonormal. Also note that MM = I . If 



we consider an n qubit system, we can perform the transformation M on each qubit independently in sequence, thus 

Yl 

transforming the state of the system. A system consisting of n qubits has N=2 basis states, so the state transition 
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matrix representing this operation is of dimension 2x2. Consider the case when the starting state is one of the 2" 
basis states, i.e. it is described by an arbitrary string of n binary digits composed of some Os and some Is. The result 
of performing the transformation M on each qubit will be a superposition of states consisting of all possible n bit 



binary strings with amplitude of each state being ±2 . This transformation is referred to as the W-H transformation 

T 

and denoted by W . Note that since MM = I , it follows that WW = I . Also note that, since M = M , it follows 

that W= W T .A generalization of this is the Quantum Fourier Transformation, which leads to applications such as 
factorization. 

The other transformation that we need is the selective phase inversion of the amplitude in certain marked 
states. The transformation matrix describing this for a four state system with selective phase inversion of the second 



state is: 



10 
0-100 
10 
1 



. This is clearly unitary. Unlike the W-H transformation, in this case, the probability in each state 



stays the same. 

Consider a binary function f(x) that is either or 1 . A quantum mechanical circuit can be designed to 

invert the amplitude in the set of basis states, x , for which f(x) = 1 - provided we are given a quantum mechanical 

black box that evaluates the function f(x) for any specified x such as that in Figure 1 . Note that we do not need to 

know in advance which x makes the function equal to 1 - all we need is a circuit which for an arbitrary x , can tell us 

whether or not f(x) is 1 . Given such a quantum mechanical black box, a selective inversion transformation can be 
realized by using this box along with some of the gates discussed so far - this is described in the circuit in Figure 5 (in 
appendix). 



X 




X 



Figure 1 - A quantum mechanical "black box" that evaluates f(x) . 
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l(iii) Quantum Search As mentioned in the introduction, in the quantum search algorithm we are given a cer- 
tain condition and we need to find out which one of the specified N possibilities satisfies this. This problem can be 
represented as a binary function f(x) defined over N basis states (denoted by x ), f(x) is known to be 1 at a single 
value of x , say t (t for target) - the goal is to find t . Without any other information about the structure of f(x) , it 
N 

would take an average of — function evaluations to solve this problem on a classical computer. [1] found a quantum 

mechanical algorithm that took only O(Jn) steps. 

This was of considerable interest since, by properly defining the function f(x) , a number of different prob- 
lems, such as the crossword puzzle mentioned in the introduction, could be cast in this form. All that is needed is a 
quantum mechanical box for evaluating the function f(x) . By putting a small amount of hardware around this box it 
is possible to search N possibilities in only 0(«JN) steps. The only information about f(x) needed is that it is a 
binary function that is either or 1 and that it is 1 at only a single point in the domain. 

The quantum search algorithm consisted of Jn repetitions of the operator -I-WIjW starting with the basis 

state where all qubits are - this basis state is denoted by . W denotes the Walsh-Hadamard (W-H) transformation, 
I j denotes the selective phase inversion of the target state t where the function f(x) evaluates to 1, /- denotes the 

selective phase inversion of . This creates a superposition all of whose amplitude is in the basis state t . A measure- 
ment then immediately identifies t . 

It is easily possible to carry out the algebra and verify that the evolution of the system is such that in every rep- 
etition of -I-WIrW, the amplitude in the t state rises by 0[ -j= ] [1]. Unfortunately, this calculation does not give 



much insight into how such an algorithm could have been first invented. The algorithm was first presented in a con- 
ference in terms of a diffusion transform [2], which is similar to the way it was invented, but the significance of this 
was not usually appreciated. Later on, there were more interpretations: inversion about average, rotation in a two 
dimensional Hilbert space, antenna array [3]. All of these describe some aspect of the algorithm, but they are very dif- 
ferent from the way it was initially invented. 

2. Schrodinger's Equation Quantum mechanics was historically developed in the context of atomic 
physics. One of the first successful descriptions of atomic phenomena was the Schrodinger's Equation which was dis- 
covered by Erwin Schrodinger in 1926 and to this day it continues to be the most commonly used description for non- 
relativistic phenomena. This was the aspect of quantum mechanics that I was familiar with when I started investigat- 
ing quantum algorithms in 1995. 

In Schrodinger's framework, the basis states are continuous and uniformly distributed in space. Instead of the 
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discrete states |0) and |1) , there is a continuum of states denoted by \x) where x is a continuous variable. The state 
vector is specified in the form of a function V(/(x) which denotes the amplitude in the state \x) (this function is usu- 
ally referred to as a wavefunction). Schrodinger's Equation describes the evolution of the wavefunction V|/(x) in time 
for actual physical systems which are described by potential functions. 

Let us begin with Schrodinger's Equation in one dimension (leaving out the scaling constants in order to focus 

a d 2 

on the nature of the equation): i \\i(x, t) = - \\i(x, t) + V(x)\\i(x, t) , which may be written as 

at -i 2 

dx 

d d 2 

— \|/(x, t) = i \|/(x, t)-i : V(x)\|/(x, /) . This describes the evolution of the wavefunction \|/(x, t) in the presence of 

at 2 
dx 

a potential function: V(x) . 

This is analogous to the diffusion equation with absorption. If we break up the evolution into infinitesimal 
time steps of size dt and imagine that each point is connected to its two adjacent neighbors on an infinitesimal grid of 

size dx, then the derivatives in Schrodinger's equation can be written as: ^-\\f(x, t ) = ^ X ' t + ^ — — and 

at dt 

2 

d , \ \\f(x + dx, t) + \\f(x - dx, t) -2w(x, t) , . .„ , _ . . 

— - \(/(x, t) = . It simplifies notation if we define a constant e such that 

dx {dx) 

dx= — . Substituting the definitions for the derivatives into Schrodinger's equation, we obtain the evolution of the 
V e 

wave function with time, i.e. \\i(x, t + dt) = ( 1 - iV(x)dt - 2i£)\\f(x, t) + ie(\\t(x + dx, t) + \\i(x - dx, t)) . 

In the Schrodinger's equation there is a continuum and therefore an infinite number of states; however in 
order to demonstrate the principle, assume just a 4-state system where the states are arranged in a loop with diffusion 
from each state to its two adjacent states. [\|/(x, t)] is a column vector with 4 components. Then in accordance with 
Schrodinger's equation the evolution for a time dt may be represented as a state matrix transformation as follows: 

1 - iV{x^)dt-2iz ;'e ie 

i'e 1 - i V(x 2 )dt-2ie ;'e 

/e 1 - iV(x 3 )dt-2iz ie 

/e /e 1 - iV(x 4 )dt-2ie 



[\\i(x, t + dt)] = 



[\|f(x, t)] 



It may be verified that this is a unitary transformation in the limit of infinitesimal dt and infinitesimal e , i.e. the mag- 

2 2 

nitude of each column vector is 1 + 0((dt) ) + <9(e ) and the dot product of any two column vectors is 
0((dt) 2 ) + <9(e 2 ) . 

The above state transition matrix may be represented approximately as follows: 

[\\i(x,t+ dt)] = DR[\\i(x, t)] , where: 
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D = 



l-2;'e ie z'e 
/e 1-2/e ie 
ie 1-2/e /e 
ie ie 1-2/e 



exp(-iV(x 1 ),if) 

exp(-iV(x 2 )dt) 

exp(-iV(x 3 )dt) 















exp(-;' V(x^)dt) 



2 2 

The representation is accurate up to terms 0((dt) ) + (3(e ) . Also note that R is exactly unitary, but D only 

2 

approximately so, up to 0(e ) . 

The sum of the entries in each column of D is unity so it is like a Markov diffusion process with imaginary 
state transition probabilities (D is short for Diffusion Transform). This Markov diffusion feature will be made use of 
later in the design of the algorithm. 

Repeating the infinitesimal state transformation, we obtain the transformation for finite times: 

.(DR)(DR)(DR)[y(x, 0)] 



x)] = (DR) — repetitions. 

Note that D and R do not commute, which is why in the equation above they have to be carefully listed out in the 
indicated order. In theoretical physics this technique for building up finite transformations out of infinitesimal non- 
commuting operations is called Trotter's formula. 



3. Infinitesimal Quantum Search By analogy with a classical situation, we know that if we start with 
a uniform superposition in a line and let it evolve in the presence of a potential function, it will gravitate towards 
points at which the potential is lower (see Figure 2.) 




tttttttttt ttlttlt 



Figure 2 - Just as the balls roll down into regions of lower potential energy, a uniform quantum 
superposition evolving under Schrddinger's equation will gravitate towards lower potential energy regions. 

Therefore, in order to design an algorithm that will reach certain marked states, one can give these marked 
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states a lower potential and then implement the same iterated transformations obtained from the evolution of 
Schrodinger's equation. We assume a single marked state. Assume we do not know which one this is, but we have a 
means of selectively rotating its phase by any desired amount (the implementation of such a transformation is dis- 
cussed in section 4). Using such a phase rotation transformation, R , we design an algorithm using the analogy with 
Schrodinger's equation. 

Consider the same sequence of infinitesimal transformations used to represent the evolution of a system in 
the presence of a potential (last equation of the previous section). In the design of a quantum algorithm an immediate 
improvement is obtained by noticing that we can connect each state to all other states, instead of just to states that are 
adjacent on the grid, thus synthesizing an algorithm of the type: 

[ ¥ (x,T)] = (DR) (DR)(DR)(DR)[y(x,0)] 



where: 1 D = 



(1-iNe) 


ie 


ie 


ie 




1 





. 


. 


ie 


(1-iNe) 


ie 


ie 







exp(i'y) 


. 


. 


ie 


ie 


(1-iNe) . 


ie 


and R = 








1 . 


. 


ie 


ie 


ie 


. (l-iNe)_ 










. 


. 1 



Note that according to the state-transition transformation derived from Schrodinger's Equation, positive values of y 
correspond to negative potentials. The question is regarding whether such a series of transformations will really drive 
the system into the marked state and if so how large e and y should be and how many times the (DR) operation has 
to be repeated. 

Assume [v(/(x, 0)] to be the uniform superposition, that is, a wavefunction with equal amplitudes in all 
states. The effect of R is to rotate the phase of the target state by y . Note that the sum of entries in each column of D 
is unity, hence the operation D is analogous to a Markov diffusion process with a transition probability from any 
state to any other state being ie . This analogy simplifies the analysis and design of the algorithm. It immediately fol- 
lows that there is no net exchange between states that have the same amplitude since the transfer in the two directions 
cancels out, the only net exchange is between the state with the rotated phase and each of the other states. 

K 

The increase of amplitude in the marked state due to D is maximum when R initially rotates its phase by - - 

the increase of amplitude is then proportional to e as shown in Figure 3. Let the amplitudes in the unmarked and 
k iK 

marked states be — and — respectively. Initially k and K are both 1 but as the amplitude in the marked state 



increases, K rises to O(jN) and k shrinks. 



1 . For an A^-state system, in order for the terms in each row to sum to unity, the diagonal terms of D should 
be ( 1 -i (N - 1 )e) . However, for simplicity, we write this as ( 1 -iNe) . For N large, the error due to this is 
negligible. 
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Summing the transfers between each of the unmarked states and the marked state, it follows that after D , the 
amplitude in the marked state becomes -^7= + ^=Nt + -^=Nt and the amplitude in each of the other states becomes 

Jn Jn Jn 

k K ilc 

—— - — — e - — — e . Assuming k to be of order 1 (as is the case initially), it follows that there is a net change of mag- 

Jn Jn Jn 

Nt 

nitude of approximately — in the amplitude of the marked state. Also, the phase of the marked state gets rotated by 
JN 

approximately Ne , the magnitude and phase in the unmarked states stay approximately the same, (k , the magnitude 
of the amplitudes in the unmarked states, is easily estimated by a conservation of probability argument, i.e. the quan- 

2 K 2 

tity k + — is the total probability and is always 1 . Therefore, as long as K < JN/2 , k lies in the range 
l/2<k< 1). 

ie 



A 

Jn 




The tilt of this amplitude denotes that its phase 
is rotated relative to other amplitudes. 



Figure 3 - The diffusion operation (D) results in a net transfer of amplitude proportional to £ from each state to 
the marked state whose phase has been rotated by R. There is no net transfer between states with equal amplitude. 

The previous paragraph describes how it is possible to obtain a transfer into the marked state even without 
knowing which one it is, provided we can somehow rotate the phase of the marked state and subsequently apply a dif- 
fusion operation, D, that transfers amplitude proportional to ie between any two states. The analysis of the previous 
paragraph suggests that the maximum transfer is achieved by keeping e as high as possible which might accomplish 
the entire transfer in a single operation. The problem, however, is that the matrix D, as defined in the previous section, 

2 

is no longer unitary for large e . If we look at the columns, their dot product is 0(Ne ) and the sum of the squares of 
magnitudes in each column is 1 + ') . D is therefore only approximately unitary provided £ « — . 

The transfer of amplitude into the state with the rotated phase was estimated as — . Assuming £ = O^— J > 

this transfer becomes of-p^l . The rotation of the phase of the marked state which was A^£ , becomes of order 1. If 



-JNJ 

we adjust the R matrix to unrotate the phase by this precise amount, then we can repeat the D operation to obtain an 
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additional transfer. It follows that in 0( *JN) repetitions of the (DR) cycle, the amplitude and thus probability in the 
marked state will become a quantity of order 1. 



4. An Exactly Unitary Diffusion Transformation in the previous section, there is the tricky 

matter of precisely how high e can be. The issue is regarding the fact that the matrix D is unitary only up to 

2 2 1 
0(N e ) and therefore e is required to be much smaller than — . The number of steps required by the algorithm is 

which can therefore at best be a large constant times Jn - the precise scaling factor will depend on precisely 



how much e is smaller than 



We could carry out a perturbation to make D unitary to higher powers of e thus making it possible to make 
e higher. Instead, this section will create a diffusion matrix that is exactly unitary. 



Consider D = 



abb. 
b a b . 
b b a . 



where a and b are complex constants to be determined based on the uni- 



b b b ... a_ 

tarity condition. Note that the structure of this matrix immediately implies that the sum of the entries in each column 
be exp(/A-) where X is real (if X had an imaginary component, it would imply that the sum of the amplitudes would 
increase or decrease exponentially as the wavefunction evolved). Therefore by performing a phase rotation transfor- 
mation on D we can always transform it to a form where the sum of the entries in each column add to 1 . 

The condition that the sum of the absolute squares of each column vector of D is unity, gives equation (i) 
below and the condition that any two column vectors of D be orthogonal yields equation (ii). 



\a\ 2 + (N-l)\b\ 2 = 1 



(i) 



2Real(ab ) + (N-2)\b\ = (ii) 

The case previously considered in section 3 (a = 1 - ;We ; b = ie) satisfies these two conditions approxi- 
mately provided e « ^ . We would like to make \b\ as high as possible in order to attain the maximum transfer in each 



step. This section shows that \b\ can be made as high as 



N' 



There is some freedom in the choice of a and b since there are 4 variables (the real and imaginary portions 
of a and b) and only 2 equations. One of the degrees of freedom represents the fact that if we have any solution (a, b) 
where a and b are complex, then (aexp(;'(j)), bexp(i§)) is also a solution for any real (j) . Therefore, we can choose 
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the phase of one of the variables arbitrarily, say we choose a to be real. The equations (i) and (ii) then become: 
a 2 r + (N-l)(b 2 r +bf) = 1 (La) 

2a r b r +(N-2)(b 2 r + b 2 ) = (ii.a) 

2 

Substituting for a r in terms of \b\ from (i.a) into (ii.a) and squaring both sides, we obtain: 



(N-2) 2 \b\ 4 =4b 2 
1 _ (^V_ 



<4\b\ Z 

2 2 2 

This leads to the bound \b\ < — which implies \b I < — . Trying out b = — b- = indeed gives a solution with 

2 

a = - 1 + — and D becomes: 
N 



D = 



-1 + 

2 
N 

2 
N 

2 
N 



N 



2 
N 

1 + 

2 
N 

2 
N 



N 



2 
N 

2 
N 

1 + 



N 



2 
N 



2 
N 

2 
N 

2 
N 



1 + 



N 



Note that as mentioned in the beginning of this section, the modulus of the sum of the entries in each column is 
indeed 1; hence it too may be viewed as a Markov Process with transition probability from any state to any other state 
2 

being — . This fact will help in the design of the quantum search algorithm in the next section. 



5. The Quantum Search Algorithm it is easily seen that the maximum transfer into the marked state 
that can be accomplished by the matrix D happens when the phase of the marked state is rotated by K relative to the 
other states. This is illustrated in Figure 4, contrast this to the situation in section 3 where the maximum transfer was 



attained when the phase of the marked state was rotated by - . 
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2/N 

— A 

V 2/N ' 

) jn 

Figure 4 - The diffusion transformation (D) results in a transfer of amplitude to the selected state whose phase is 
rotated by n by the R transformation. There is no net transfer between states with equal amplitude. 

Assume the amplitude in the marked state to be - — , where K « Jn , and the amplitude in each of the 

Jn 

k 

other states to be — — where k < 1 . As a result of D , as shown in Figure 4, there is a net transfer of 

Jn 

2 ( k K \ K 
N x - — — + — — into the marked state. When we add the initial amplitude of - — — , the total amplitude in the 

N yjN Jn) Jn 

marked state after D becomes (K+2k) ^j m jj ar j a( jding the initial amplitude to the amplitude transferred from the 

Jn 

k 2fk\ 2K k f I \ 

marked state, the amplitude in each of the other states becomes — — - — — — - — — which is — — - O — . There- 

Jn nVJnJ n Jn In) 

K ( K + 2.k) 

fore the net result of the operation is to change the amplitude in the marked state from -— to — — while leav- 

Jn Jn 

ing the amplitudes in the unmarked states approximately the same. 

After this, if the amplitude in the marked state is inverted by R , the transformation D can be used again to 
2k 

further increase the magnitude by — . As in section 3, k stays virtually unaltered and is of order 1, it follows that in 

Jn 

O(Jn) repetitions of the (DR) cycle, the amplitude in the marked state gets boosted to a quantity of order 1. 
The algorithm can hence be described by the following sequence of transformations: 

lV flnal W] = , (DR) (DR)(DR)(DR) [y init W] 

JN reps 

Here [v|/- M is the uniform superposition with equal amplitude of -j= in all N states, [¥fi„ a /( x )] is a superpo- 

Jn j 

sition which has an amplitude of order 1 in the marked state and small, uniform amplitudes in all other states. The 
matrices D and R are summarized below. 
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A 7 
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6. Synthesizing D The previous section described the matrix transformations which, when carried out, result 
in a significant amplitude being obtained in the marked state. These were NxN matrix transformations and we still 
need to show how to synthesize these by means of 0(logN) two dimensional rotations and one dimensional reflec- 
tions of the type discussed in section l(ii) since these are the technologically feasible quantum computing operations. 

By using the fact that WW = I with W as defined in section l(ii), the transformation matrix D as defined 
at the end of the previous section can be written in the following form: W(WDW)W. Substituting for D from the 
previous section, it follows that: 

i\ \ f 



D = -W 



f 


( 






w 


<-§ 




V 





1 1 1 
1 1 1 
1 1 1 



1 1 1 



w 



w 



-w 



1 1 1 
1 1 1 
1 1 1 



1 1 1 



w 



w 



(a) 



Assume that the states are encoded so that the first state is the one with all qubits in the state (as mentioned 

before, this state is denoted by ). Then since M|0) = -^=(|0> + |1)) , it follows that W transforms to a superposi- 

72 



tion with equal amplitudes in all N states, i.e. W 



1 




i 





J_ 


l 









l 


Jn 









l 



. Since WW = /, it follows by multiplying both 





1 




i 




1 







sides by W, that W 


1 


= Jn 







1 








. Therefore: 
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1 . 


. 1 
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1 


1 . 


. 1 




1 





. 


. 




1 


1 


1 . 


. 1 










. 


. 










. 


. 


w 


1 


1 


1 . 


. 1 


w = Jn 








. 


. 


W = N 








. 


. 




1 


1 


1 . 


. 1 










. 


. 










. 


. 



(b) 



Substituting (b) in (a): 

( 



D = -W 



1-2 



1 





. 


. 








. 


. 








. 


. 








. 


. 



w = -\ 





-1 





. 


. 







1 


. 


. 


w 








1 . 


. 










. 


. 1 



w 



(c) 



The equation (c) is written as D = -WI-W , where /- denotes selective inversion of the state. 

7. Concluding remarks This paper has outlined the steps that went into the development of the quantum 
search algorithm. As with many new scientific developments, the steps are far from rigorous. Nevertheless, I have 
found these insights helpful in developing the search algorithm and its extensions. 

There have been several analyses of the search algorithm but there are still only partial answers to some 
basic questions which seems to suggest our understanding of this algorithm is still limited. The first question is: What 

is the reason that one would expect that a quantum mechanical scheme could accomplish the search in 0(jN) 
steps? It would be insightful to have a simple two line argument for this without having to describe the details of the 
search algorithm. 

It has been proved that the quantum search algorithm cannot be improved at all, i.e. any quantum mechani- 
cal will need at least O(jN) steps to carry out an exhaustive search of N items [4] [5]. Why is it not possible to 

search in fewer than 0( JN) steps? The arguments used to prove this are very subtle and mathematical. What is lack- 
ing is a simple and convincing two line argument that shows why one would expect this to be the case. 



8. Appendix - The quantum search algorithm is based on the assumption that it is possible to selectively invert 
the amplitudes in a basis state where the function f(x) evaluates to 1. As mentioned in section l(ii) such a circuit can 
be designed, provided we are given a quantum mechanical circuit to evaluate the function f(x) in any specified basis 
state x (Figure 1). The selective inversion circuit is shown in Figure 5. 
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Input 



x (n qubits) 



B (1 qubit) 



1 


" 1 " 


1 




-1 





| CNOT | 



x (n qubits) 
o/i Output 

B XOKf(~x) 



a 
b 

— c 
d 



Figure 5 - The above quantum mechanical circuit selectively inverts amplitudes of 
precisely those basis states where the function f(x) evaluates to 1. 



To analyze schematic quantum circuits, such as in Figure 5, one examines the transformation of the input basis states 
- and then by the superposition principle as described at the end of section l(i), the effect on any superposition can be 

obtained. It is easily seen in the above circuit that if for some w-qubit input basis state x , the output of the f(x) gate is 
1, the ancilla qubit superposition is transformed from (a, P) to (P, a) . If a is and p is — ^ , something very 

interesting happens. If the output of the f(x) gate is 0, the ancilla bit superposition stays unchanged; whereas if the 

output of the f(x) gate is 1, the ancilla qubit superposition is transformed from f ] into f — -7: \ . This is 

equivalent to changing the sign of the amplitude. Thus in any n-qubit superposition, the amplitude of precisely those 
basis states are selectively inverted for which the function f(x) is 1. 
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